Arabidopsis promoters

ABSTRACT

The present invention provides several promoters isolated from  Arabidopsis thaliana,  which promoters are capable of driving and/or regulating the expression of an operably linked nucleic acid in a plant. The expression patterns of the promoters according to the present invention have been studied in  Arabidopsis thaliana  and some of the promoters displayed specific activity in particular cells, tissues or organs of the plant, while others displayed constitutive expression throughout substantially the whole plant. Some promoters showed weak expression, while others were strongly active.

The present invention relates to the field of plant molecular biology, more particularly to nucleic acid sequences useful for driving and/or regulating expression of an operably linked nucleic acid in a plant. The isolation of these nucleic acid sequences from Arabidopsis thaliana, as well as their use in driving and/or regulating expression of an operably linked nucleic acid is disclosed. The present invention therefore concerns promoters, hybrid promoters, genetic constructs expression cassettes, transformation vectors, expression vectors, host cells and transgenic plants comprising a promoter according to the present invention. The present invention also concerns methods for driving and/or regulating expression of a nucleic acid and methods for the production of transgenic plants comprising a promoter according to the present invention.

Gene expression is dependent on initiation of transcription, which is mediated via a transcription initiation complex. Regulation of transcription to determine how strong, when or where a gene is expressed may be mediated via transcriptional control elements, which are generally embedded in the nucleic acid sequence 5′-flanking or upstream of the expressed gene. This upstream nucleic acid region is often referred to as a “promoter” since it promotes the binding, formation and/or activation of the transcription initiation complex and is therefore capable of driving and/or regulating expression of the 3′ downstream nucleic acid sequence.

Genetic engineering of plants, aimed at obtaining useful plant phenotypes, often involves heterologous gene expression, which is generally mediated by a promoter capable of driving and/or regulating expression of an operably linked heterologous nucleic acid. The phenotype of the host plant depends not only on the contribution of the heterologous nucleic acid, but also on the contribution of the specific expression pattern of the chosen promoter determining how, where and when that heterologous nucleic acid is expressed. Accordingly, the choice of promoter with a suitable expression pattern is of crucial importance for obtaining the desired plant phenotype. A person skilled in the art will need to have available different promoters, to determine the optimal promoter for a particular (heterologous) nucleic acid. This availability is rather limited and there is therefore a continuing need to provide new promoters with various expression profiles in plants.

The nucleic acids as represented by SEQ ID NO 1 to 9 were isolated from Arabidopsis thaliana and have been found to be capable of driving and regulating expression of an operably linked (heterologous) nucleic acid. Therefore the present invention offers a collection of nucleic acids which have been isolated for the fist time and these nucleic acids have been found to act as promoters and their expression patterns are now disclosed for the first time. It is demonstrated that these isolated nucleic acids are useful as promoters in heterologous gene expression.

Accordingly, the present invention provides an isolated promoter capable of driving and/or regulating expression, comprising:

-   -   (a) an isolated nucleic acid as represented by any one of SEQ ID         NO 1 to 9 or the complement of any one of SEQ ID NO 1 to 9, or     -   (b) an isolated nucleic acid having at least 90% sequence         identity with any one of the DNA sequences as represented by any         one of SEQ ID NO 1 to 9; or     -   (c) an isolated nucleic acid specifically hybridizing under         stringent conditions with any one of the DNA sequences as         represented by any one of SEQ ID NO 1 to 9; or     -   (d) an isolated nucleic acid as defined in any one of (a) to         (c), which is interrupted by an intervening sequence; or     -   (e) a fragment of any one of the nucleic acids as defined in (a)         to (d), which fragment is capable of driving and/or regulating         expression.

The term “isolated” as used herein means being removed from its original source. Preferably, the isolated promoter is free of sequences (such as protein-encoding sequences or other sequences at the 3′ end) that naturally flank the promoter in the genomic DNA of the organism from which the promoter is derived. Further preferably, the isolated promoter is also free of sequences that naturally flank it at the 5′ end. Further preferably, the isolated promoter may comprise less than about 5 kb, 4 kb, 3 kb, 2 kb, 1.5 kb, 1.2 kb, 1 kb, 0.8 kb, 0.5 kb or 0.1 kb of nucleotide sequences that naturally occur with the promoter in genomic DNA from the organism of which the promoter is derived.

The present invention is not limited to the nucleic acids as represented by SEQ ID NO 1 to 9. A person skilled in the art will recognize that variants or fragments of a nucleic acid may occur, whilst maintaining the same functionality. These variants or fragments may occur in nature or may be man made (e.g. by genetic engineering). Therefore the present invention extends to variant nucleic acids and fragments of any one of SEQ ID NO 1 to 9, which variants or fragments are useful in the methods of the present invention. Such variants and fragments include:

-   -   (a) an isolated nucleic acid as represented by any one of SEQ ID         NO 1 to 9 or the complement of any one of SEQ ID NO 1 to 9; or     -   (b) an isolated nucleic acid having at least 90% sequence         identity with any one of the DNA sequences as represented by any         one of SEQ ID NO 1 to 9; or     -   (c) an isolated nucleic add specifically hybridizing under         stringent conditions with any one of the DNA sequences as         represented by any one of SEQ ID NO 1 to 9; or     -   (d) an isolated nucleic add as defined in any one of (a) to (c),         which is interrupted by an intervening sequence; or     -   (e) a fragment of any one of the nucleic adds as defined in (a)         to (d), which fragment is capable of driving and/or regulating         expression.

Suitable variants of any one of SEQ ID NO 1 to 9 encompass homologues which have in increasing order of preference at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with any one of the nude ic acids as represented by SEQ ID NO 1 to 9.

The percentage of sequence identity is calculated using a pairwise global alignment program implementing the algorithm of Needleman Wunsch (J. Mol. Biol. 48: 443 453, 1970), which maximizes the number of match es and minimizes the number of gaps. For calculation of the above-menboned percentages, the program Align X (as part of the Vector NTI suite 5.5) may be used with the standard parameters and the variable parameters gap opening penalty 10 and gap extension penalty 0.1. “Sequence identity” as used herein is preferably calculated over the entire length of the nucleic acid as represented by any one of SEQ ID NO 1 to 9. The length of these nucleic aicds is presented in Table 2.

Search and identification of homologous nucleic adds, would be well within the realm of a person skilled in the art. Such methods, involve screening sequence databases with the sequences provided by the present invention, for example any one of SEQ ID NO 1 to 9, preferably in a computer readable form. Useful sequence databases, include, but are not limited to, Genbank (http:/www.ncbi.nim.nih.gov/web/Genbank), the European Molecular Biology Laboratory Nucleic acid Database (EMBL) (http:/w.ebi.ac.uklebi-docs/embl-db.html) or versions thereof, or the MIPS database (http://mips.gsf.de/). Different search a lgorithms and software for the alignment and comparison of sequences are well known in the art. Such software includes, for example GAP, BESTFIT, BLAST, FASTA and TFASTA. Preferably BLAST software is used, which calculates percent sequence identity and per forms a statistical analysis of the similarity between the sequences. The suite of programs referred to as BLAST programs has 5 different implementations: three designed for nucleotide sequence queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries (BLASTP and TBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; Birren et al., Genome Analysis, 1: 543, 1997). The software for performing BLAST analysis is publicly available through the National Centre for Biotechnology Information.

The sequences of the genome of Arabidopsis thaliana and the genome of Oryza sativa are now available in public databases such as Genbank. Other genomes are currently being sequenced. Therefore, it is expected that as more sequences of the genomes of other plants become available, homologous promoters may be identifiable by sequence alignment with any one of SEQ ID NO 1 to SEQ ID NO 9. The skilled person will readily be able to find homologous promoters from other plant species, for example from other dicotyledonous plants, such as other members of the Brassicaceae family or from other plant families. Homologous promoters from crop plants are especially useful for practising the methods of the present invention in crop plants.

One example of homologues having at least 90% sequence identity with any one of SEQ ID NO 1 to 9 are allelic variants of any one of SEQ ID NO 1 to 9. Allelic variants are variants of the same gene occurring in two different individuals of the same species and usually allelic variants differ by slight sequence changes. Allelic variants may encompass Single Nucleotide Polymorphisms (SNPs) as well as Small Insertion/Deletion Polymorphisms (INDELs). The size of INDELs is usually less than 100 bp. SNPs and INDELs form the largest set of sequence variants in naturally occurring polymorphic strains of most organisms.

Homologues suitable for use in the methods according to the invention may readily be isolated from their source organism via the technique of PCR or hybridization. Their capability of driving and/or regulating expression may readily be determined, for example, by following the methods described in the Examples section by simply substituting the sequence used in the actual Example with the homologue.

Another variant of any one of SEQ ID NO 1 to 9 encompassed by the present invention is a nucleic acid specifically hybridising under stringent conditions to any one of the nucleic acids of SEQ ID NO 1 to 9. The term hybridising means annealing to substantially homologous complementary nucleotide sequences in a hybridization process. Tools in molecular biology relying on such a hybridization process include the polymerase chain reaction (PCR; and all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RNAs, and DNA sequence determination, Northern blotting (RNA blotting), Southern blotting (DNA blotting). The hybridisation process can also occur with one of the complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose beads or any other resin. Tools in molecular biology relying on such a process include the isolation of poly (A+) mRNA. The hybridisation process can furthermore occur with one of the complementary nucleic adds immobilised to a solid support such as a nitro-cellulose or nylon membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support (the latter known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools in molecular biology relying on such a process include RNA and DNA gel blot analysis, colony hybridisation, plaque hybridisation, in situ hybridisation and microarray hybridisation. In order to allow hybridisation to occur, the nucleic acid molecules are generally thermally or chemically denatured to melt a double strand into two single strands and/or to remove hairpins or other secondary structures from single stranded nucleic adds. The stringency of hybridisation is influenced by conditions such as temperature, salt concentration and hybridisation buffer composition. Conventional hybridisation conditions are described in, for example, Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, CSH, New York, but the skilled craftsman will appreciate that numerous different hybridisation conditions may be designed in function of the known or the expected homology and/or length of the nucleic acid sequence. High stringency conditions for hybridisation include high temperature and/or low sodium/salt concentration (salts include sodium as for example in NaCl and Na₃-citrate) and/or the inclusion of formamide in the hybridisation buffer and/or lowering the concentration of compounds such as SDS (sodium dodecyl sulphate detergent) in the hybridisation buffer and/or exclusion of compounds such as dextran sulphate or polyethylene glycol (promoting molecular crowding) from the hybridisation buffer. Specific hybrisization under stringent conditions is preferably carried out at a temperature of 60° C. followed by washes in 0.1 to 1×SSC, 0.1×SDS, and 1×SSC, 0.1× SDS. Sequences capable of specifically hybridising under stringent conditions are sequences that are very similar.

The invention also relates to a nucleic acid molecule of at least 15 nucleotides in length hybridizing specifically with any one of the nucleic acids of the invention. The invention also relates to a nucleic acid molecule of at least 15 nucleotides in length specifically amplifying a nucleic acid of the invention by polymerase chain reaction.

Another variant of any one of SEQ ID NO 1 to 9 encompassed by the present invention is a variant, which is interrupted by an intervening sequence. For example, any one of the nucleic adds as represented by SEQ ID NO 1 to 9 may be interrupted by an intervening sequence. With “intervening sequence” is meant any nucleic acid or nucleotide, which disrupts another sequence.

Examples of intervening sequences include introns, nucleic acid tags, T-DNA and mobilizable nucleic acids sequences such as transposons or nucleic adds that may be mobilized via recombination. Examples of particular transposons comprise Ac (activator), Ds (Dissociation), Spm (suppressor-Mutator) or En. The introduction of introns into promoter s is now widely applied. The methods according to the present invention may also be practised using a nucleic acid sequence according to any one of SEQ ID NO 1 to 9 provided with an intron. In case the intervening sequence is an intron, alternative splice variants of the nucleic acids according to the invention may arise. The term “alternative splice variant” as used herein encompasses variants of a nucleic add sequence in which intervening introns have been excised, replaced or added. Such splice variants may be found in nature or may be manmade. Methods for making such promoters with an intron or for making the corresponding splice variants are well known in the art

Variants interrupted by an intervening sequence, suitable for use in the methods according to the invention may readily be determined for example by following the methods described in the Examples section by simply substituting the sequence used in the actual Example with the variant.

The variant nucleic adds as described hereinabove may be found in nature (for example allelic variants or splice variants). Additionally and/or alternatively, variants of any one of SEQ ID NO 1 to 9 as described hereinabove may be manmade via techniques well known in the art involving for example mutation, substitution, insertion, deletions or derivation. The present invention also encompasses such variants, as well as their use in the methods of the present invention.

A “mutation variant” of a nucleic add may readily be made using recombinant DNA manipulation techniques or nucleotide synthesis. Examples of such techniques include site directed mutagenesis via M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, Cleveland, Ohio), QuickChange Site Directed mutagenesis (Stratagene, San Diego, Calif.), PCR-mediated site-directed mutagenesis or other site-directed mutagenesis protocols. Alternatively, the nucleic acid of the present invention may be randomly mutated, for example by “error prone PCR”.

A “substitutional variant” refers to those variants in which at least one residue in the nucleic acid sequence has bee n removed and a different residue inserted in its place. Nucleic acid substitutions are typically of single residues, but may be clustered depending upon functional constraints placed upon the nucleic acid sequence; insertions usually are of the order of about 1 to about 10 nucleic acid residues, and deletions can range from about 1 to about 26 residues.

An “insertional variant” of a nucleic acid is a variant in which one or more nucleic acid residues are introduced into a predetermined site in that nucleic acid. Insertions may comprise 5′-terminal and/or 3′-terminal fusions as well as intra-sequence insertions of single or multiple nucleotides. Generally, insertions within the nucleic acid sequence will be smaller than 5′- or 3′-terminal fusions, of the order of about 1 to 10 residues. Examples of 5′- or 3′-terminal fusions include the coding sequences of binding domain s or activation domains of a transcriptional activator as used in the yeast two-hybrid system or yeast one-hybrid system, or of phage coat proteins, (histidine)₆-tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate reductase, Tag•100 epitope, c-myc epitope, FLAG-epitope, lacZ, CMP (calmodulin-binding peptide), HA epitope, protein C epitope and VSV epitope.

The term “derivative” of a nucleic acid may comprise substitutions, and/or deletions and/or additions of naturally and non-naturally occurring nucleic acid residues compared to the natural nucleic acid. Derivatives may, for example, comprise methylated nucleotides, or artificial nucleotides.

Also encompassed by the present invention are fragments of the nucleic acids as represented by any one of SEQ ID NO 1 to 9 or variants thereof as described hereinabove. A “fragment” as used herein means a portion of a nucleic acid sequence. Suitable fragments useful in the methods of the present invention are functional fragments, which retain at least one of the functional parts of the promoter and hence are still capable of driving and/or regulating expression. Examples of functional fragments of a promoter include the minimal promoter, the upstream regulatory elements, or any combination thereof.

Suitable fragments may range from at least about 20 base pairs to about 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950 or 1000 base pairs, up to about the full length sequence of the invention. These base pairs are typically immediately upstream of the transcription initiation start, but alternatively may be from anywhere in the promoter sequence.

Suitable means to find functional fragments (i.e. fragments comprising regulatory regions, regulatory elements, cis-elements, boxes or minimal promoter) include software programs designed for motif searching. Preferred computer programs include MEME, SIGNALSCAN, and GENESCAN. A MEME algorithm (Version 2.2) is found in version 10.0 of the GCG package; or on the internet site http://www.sdsc.edu/MEME/meme. SIGNALSCAN version 4.0 information is available on the internet site http://biosci.cbs.umn.edu/software/sigscan.html. GENESCAN may be found on the internet site http://gnomic.stanford.edu/GENESCANW.html.

Suitable fragments useful in the methods of the present invention may be tested for their capability of driving and/or regulating expression by standard techniques well known to the skilled person, or by the following method described in the Example section.

The promoters as disclosed in any one of SEQ ID NO 1 to 9 are isolated as nucleic acids of approximately 1.2 kb from the upstream region of particular Arabidopsis coding sequences (CDS). These nucleic acids may include typical elements of a promoter, which are presented in FIG. 1. Generally a promoter may comprises from coding sequence into the upstream direction: (i) a 5′UTR of pre-messenger RNA, (ii) a minimal promoter comprising the transcription initiation element (INR) and, more upstream, a TATA box, and (iii) may contain regulatory elements that determine the specific expression pattern of the promoter.

The term “promoter” as used herein is taken in a broad context and refers to regulatory nucleic acid sequences capable of effecting (driving and/or regulating) expression of the sequences to which they are operably linked. A “promoter” encompasses transcriptional regulatory sequences derived from a classical genomic gene. Usually a promoter comprises a TATA box, which is capable of directing the transcription initiation complex to the appropriate transcription initiation start site. However, some promoters do not have a TATA box (TATA-less promoters), but are still fully functional for driving and/or regulating expression. A promoter may additionally comprise a CCAAT box sequence and additional regulatory elements (i.e. upstream activating sequences or cis-elements such as enhancers and silencers). A “promoter” may also include the transcriptional regulatory sequences of a classical prokaryotic gene, in which case it may include a −35 box sequence and/or a −10 box transcriptional regulatory sequences.

“Driving expression” as used herein means promoting the transcription of a nucleic acid.

“Regulating expression” as used herein means influencing the level, time or place of transcription of a nucleic acid. The promoters of the present invention may thus be used to increase, decrease or change in time and/or place transcription of a nucleic acid. For example, they may be used to limit the transcription to certain cell types, tissues or organs, or during a certain period of time, or in response to certain environmental conditions.

The promoter is preferably a plant-expressible promoter. The term “plant-expressible” means being capable of regulating expression in a plant, plant cell, plant tissue and/or plant organ. Accordingly, the invention encompasses an isolated nucleic acid as mentioned above, capable of regulating transcription of an operably linked nucleic acid in a plant or in one or more particular cells, tissues or organs of a plant.

The expression pattern of the promoters according to the present invention was studied in detail and it was found that many of them were tissue-specific. Accordingly, the present invention lo provides “tissue-specific” promoters. The term “tissue-specific” shall be taken to indicate that expression is predominantly in a particular tissue, tissue-type, organ or any other part of the organism, albeit not necessarily exclusively in said tissue, tissue-type, organ or other part. Accordingly, the invention encompasses an isolated nucleic acid as mentioned above, capable of driving and/or regulating expression (of an operably linked nucleic acid) in a tissue-specific manner. Expression may be driven and/or regulated for example in the root, root meristem, root tip, lateral roots, root central cylinder, or in hypocotyls, cotyledons, meristem, shoot, shoot meristem, leaves, trichomes, hydathodes, apical meristem, flowers, petals, pedicle, stamen, siliques, seed, embryo.

A tissue-specific promoter is one example of a so-called “regulated promoter”. These promoters are regulated by endogenous signals such as the presence of certain transcription factors, metabolites, plant hormones, or exogenous signals, such as ageing, stresses or nutritional status. These regulations may have an effect on one or more different levels such spatial specificity or temporal specificity. Encompassed within the present invention is a nucleic acid as described hereinabove, which is a “regulated promoter”. Examples of regulated promoters are cell-specific promoters, tissue-specific promoters, organ-specific promoters, inducible promoters or young tissue-specific promoters.

Alternatively and/or additionally, some promoters of the present invention display a constitutive expression pattern. Accordingly, the present invention provides a promoter as described hereinabove, which is a constitutive promoter. The term “constitutive” means having no or very few spatial or temporal regulation. The term “constitutive expression” as used herein refers to substantially continuously expression in substantially all tissues of the organism. The skilled craftsman will understand that a “constitutive promoter” is a promoter that is active during most, but not necessarily all, phases of growth and development of the organism and throughout most, but not necessarily all, parts of an organism.

The “expression pattern” of a promoter is no t only influenced by the spatial and temporal aspects, but also by the level of expression. The level of expression is determined by the so-called “strength” of a promoter. Depending on the resulting expression level, a distinction is made herein between “weak” and “strong” promoters. Generally by “weak promoter” is meant a promoter that drives expression of an operably linked nucleic, add at levels of about 1/10000 transcripts, to about 1/100000 transcripts or to about 1/500000 transcripts. Generally, by “strong promoter” is meant a promoter that drives expression at levels of about 1/10 transcripts, to about 1/100 or to about 1/1000 transcripts.

According to a particular embodiment, the invention provides an isolated promoter as mentioned hereinabove, which is a hybrid promoter. The term “hybrid promoter” as used herein refers to a chimeric promoter made, for example, synthetically, for example by genetic engineering. Preferred hybrid promoters according to the present invention comprise a part, preferably a functional part, of one of the promoters according to the present invention and at least one part of another promoter. Alternatively, hybrid promoters encompassed by the present invention comprise another part of the same promoter. One example of a hybrid promoter comprises regulatory element(s) of a promoter according to the present invention combined with the minimal promoter of another promoter. Another example of a hybrid promoter is a promoter comprising additional regulatory elements to further enhance its activity and/or to alter its spatial and/or temporal expression pattern.

The present invention also provides use of a functional fragment of any one of SEQ ID NO 1 to 9 or variant thereof for changing the expression pattern of a promoter. In such methods, at least part of any of the nucleic acids according to the present invention are combined with at least one fragment of another promoter. According to a particular embodiment the invention encompasses a method for conferring tissue-specificity, and/or constitutive expression to a promoter sequence, comprising the fusion of a promoter according to the present invention or at least a functional fragment thereof, to that promoter sequence normally not exhibiting that tissue specificity and/or constitutive expression. Such modifications and fusions may be achieved by routine experimentation by those skilled in the art.

Further, the invention provides a genetic construct comprising:

-   -   (a) an isolated promoter as defined hereinabove; and     -   (b) a heterologous nucleic acid sequence operably linked to the         isolated promoter of (a); and optionally     -   (c) a 3′ transcription terminator.

The term “genetic construct” as used herein means a nucleic acid made by genetic engineering.

The term “operably linked” to a promoter as used herein means that the transcription is driven and/or regulated by that promoter. A person skilled in the art will understand that being operably linked to a promoter preferably means that the promoter is positioned upstream (i.e. at the 5′-end) of the operably linked nucleic acid. The distance to the operably linked nucleic acid may be variable, as long as the promoter of the present invention is capable of driving and/or regulating the transcription of the operably linked nucleic acid. For example, between the promoter and the operably linked nucleic acid, there might be a cloning site, an adaptor, a transcription or translation enhancer.

The operably linked nucleic acid may be any coding or non-coding nucleic acid. The operably linked nucleic acid may be in a sense or in anti-sense direction. Typically in the case of genetic engineering of host cells, the operably linked nucleic acid is to be introduced into the host cell and is intended to change the phenotype of the host cell. Alternatively, the operably linked nucleic acid is an endogenous nucleic acid from the host cell.

The term “heterologous” as used herein is intended to be “heterologous to the promoter of the present invention”. A nucleic acid that is heterologous to the promoter of the present invention is one that is not the naturally occurring nucleic acid sequence flanking the promoter of the present invention when it is in its biological genomic environment. While the nucleic acid may be heterologous to the promoter of the present invention, it may be homologous or native or heterologous or foreign to the plant host cell. The heterologous operably linked nucleic acid may be any nucleic acid (for example encoding any protein), provided that it comprises or it is flanked by at least one nucleotide which is normally not flanking the promoter of the present invention.

The term “transcription terminator” as used in (c) refers to a DNA sequence at the end of a transcriptional unit which signals termination of transcription. Terminators are 3′-non-translated DNA sequences usually containing a polyadenylation signal, which facilitates the addition of polyadenylate sequences to the 3′-end of a primary transcript. Terminators active in and/or isolated from viruses, yeasts, moulds, bacteria, insects, birds, mammals and plants are known and have been described in literature. Examples of terminators suitable for use in the genetic constructs of the present invention include the Agrobacterium tumefaciens nopaline synthase (NOS) gene terminator, the Agrobacterium tumefaciens octopine synthase (OCS) gene terminator sequence, the Cauliflower mosaic virus (CaMV) 35S gene terminator sequence, the Oryza sativa ADP-glucose pyrophosphorylase terminator sequence (t3′Bt2), the Zea mays zein gene terminator sequence, the rbcs-1A gene terminator, and the rbcs-3A gene terminator sequences, amongst others.

The present invention also provides an expression cassette, a transformation vector and a plant expression vector comprising a genetic construct as described above.

An “expression cassette” as meant herein refers to a minimal genetic construct necessary for expression of a nucleic acid. A typical expression cassette comprises a promoter gene-terminator combination. An expression cassette may additionally comprise cloning sites, for example Gateway™ recombination sites or restriction enzyme recognition sites, to allow easy cloning of the operably linked nucleic acid to the promoter of the present invention or to allow the easy transfer of the expression cassette into a vector. An expression cassette may further comprise 5′ untranslated regions, 3′ untranslated regions, a selectable marker, transcription enhancers or translation enhancers.

With “transformation vector” is meant a genetic construct, which may be introduced in an organism by transformation and may be stably maintained in said organism. Some vectors may be maintained in, for example Escherichia coli, A. tumefaciens, Saccharomyces cerevisiae or Schizosaccharomyces pombe, while others such as phagemids and cosmid vectors, may be maintained in bacteria and/or viruses. Transformation vectors may be multiplied in their host cell and may be isolated again therefrom to be transformed into another host cell. Vector sequences generally comprise a set of unique sites recognized by restriction enzymes, the multiple cloning site (MCS), wherein one or more non-vector sequence(s) may be inserted. Vector sequences may further comprise an origin of replication which is required for maintenance and/or replication in a specific host cell. Examples of origins of replication include, but are not limited to, the f1-ori and colE1.

“Expression vectors” form a subset of transformation vectors, which, by virtue of comprising the appropriate regulatory sequences, enable expression of the inserted non-vector sequence(s). Expression vectors have been described which are suitable for expression in bacteria (e.g. E. coli), fungi (e.g. S. cerevisiae, S. pombe, Pichia pastoris), insect cells (e.g. baculoviral expression vectors), animal cells (e.g. COS or CHO cells) and plant cells. One suitable expression vector according to the present invention is a plant expression vector, useful for the transformation of plant cells, the stable integration in the plant genome, the maintenance in the plant cell and the expression of the non-vector sequences in the plant cell.

Typically, a plant expression vector according to the present invention comprises a nucleic acid of any one of SEQ ID NO 1 to 9 or a variant thereof as described hereinabove, optionally operably linked to a second nucleic acid. Typically, a plant expressible vector according to the present invention, further comprises T-DNA regions for stable integration into the plant genome (for example the left border and the right border regions of the Ti plasmid).

The genetic constructs of the invention may further comprise a “selectable marker”. As used herein, the term “selectable marker” includes any gene, which confers a phenotype to a cell in which it is expressed, to facilitate the identification and/or selection of cells that are transfected or transformed. Suitable markers may be selected from markers that confer antibiotic or herbicide resistance. Cells containing the genetic construct will thus survive antibiotics or herbicide concentrations that kill untransformed cells. Examples of selectable marker genes include genes conferring resistance to antibiotics (such as nptII encoding neomycin phosphotransferase capable of phosphorylating neomycin and kanamycin, or hpt encoding hygromycin phosphotransferase capable of phosphorylating hygromycin), to herbicides (for example bar which provides resistance to Basta; aroA or gox providing resistance against glyphosate), or genes that provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon source). Visual marker genes result in the formation of colour (for example beta-glucuronidase, GUS), luminescence (such as luciferase) or fluorescence (Green Fluorescent Protein, GFP, and derivatives thereof). Further examples of suitable selectable marker genes include the ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), bacterial kanamycin resistance gene (Kanr), phosphinothricin resistance gene, and the chloramphenicol acetyltransferase (CAT) gene, amongst others.

Furthermore, the present invention encompasses a host cell comprising an isolated promoter, or a genetic construct, or an expression cassette, or a transformation vector or an expression vector according to the invention as described hereinabove. In particular embodiments of the invention, the host cell is selected from bacteria, algae, fungi, yeast plant s, insect or animal host cells.

In one particular embodiment, the invention provides a transgenic plant cell comprising an isolated promoter according to the invention, or an isolated nucleic acid, or a genetic construct, or an expression cassette, or a transformation vector or an expression vector according to the invention as described hereinabove. Preferably said plant cell is a dicot plant cell or a monocot plant cell, more preferably a cell of any of the plants as mentioned herein. Preferably, in the transgenic plant cell according to the invention, the promoter or the genetic construct of the invention is stably integrated into the genome of the plant cell.

The invention also provides a method for the production of a transgenic plant, comprising:

-   -   (a) introducing into a plant cell an isolated promoter according         to the invention, for example as represented by any one of SEQ         ID NO 1 to SEQ ID NO 9, or a variant or fragment thereof, or a         genetic construct, or an expression cassette, or a         transformation vector or an expression vector according to the         present invention and as described hereinabove, and     -   (b) cultivating said plant cell under conditions promoting plant         growth.

“Introducing” the above mentioned isolated promoter, or genetic construct or expression cassette, or transformation vector or expression vector, into a host cell (e.g. plant cell) is preferably achieved by transformation. The term “transformation” as used herein encompasses the transfer of an exogenous polynucleotide into a host cell, irrespective of the method used for transfer. In particular for plants, tissues capable of clonal propagation, whether by organogenesis or embryogenesis, are suitable for transformation with a genetic construct of the present invention and a whole plant may be regenerated therefrom. The particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular plant species being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g. apical meristem, axillary buds, and root meristems), and induced meristem tissue (e.g. cotyledon meristem and hypocotyl meristem). The polynucleotide may be transiently or stably introduced into a plant cell and may be maintained non-integrated, for example as a plasmid. Alternatively, it may be integrated into the plant genome.

Transformation of a plant species is now a fairly routine technique. Advantageously, any of several transformation methods may be used to introduce the nucleic acid s of the invention into a suitable ancestor cell. Transformation methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA directly in to the plant, particle gun bombardment, transformation using viruses or pollen and microprojection. Methods may be selected from the calcium/polyethylene glycol method for protoplasts (Krens, F. A. et al., 1882, Nature 296, 72-74; Negrutiu I. et al., June 1987, Plant Mol. Biol. 8, 363-373); electroporation of protoplasts (Shillito R. D. et al., 1985 Bio/Technol 3, 1099-1102); microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 179-185); DNA or RNA-coated particle bombardment (Klein T. M. et al., 1987, Nature 327, 70) infection with (non-integrative) viruses and the like. A preferred transformation method for the production of transgenic plant cells according to the present invention is an Agrobacterium mediated transformation method.

Transgenic rice plants comprising any one of the promoters of the present invention are preferably produced via Agrobacterium-mediated transformation using any of the well-known methods for rice transformation, such as the ones described in any of the following: published European patent application EP 1198985 A1. Aldemita and Hodges (Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 491-506, 1993); Hiei et al. (Plant J. 6 (2) 271-282, 1994); which disclosures are incorporated by reference herein as if fully set forth. In the case of corn transformation, the preferred method is as described in either Ishida et al. (Nat. Biotechnol. June 1996; 14(6): 745-50) or Frame et al. (Plant Physiol. May 2002; 129(1): 13-22), which disclosures are incorporated by reference herein as if fully set forth.

Generally after transformation, plant cells or cell groupings are selected for the presence of one or more markers which are encoded by plant-expressible genes co-transferred with the nucleic acid of interest, following which the transformed material may be cultivated under conditions promoting plant growth.

The resulting transformed plant cell may then be used to regenerate a transformed plant in a manner known to persons skilled in the art. Accordingly, the method for the production of a transgenic plant as described hereinabove, may further comprise regenerating a plant from said plant cell of (a).

The present invention further provides a plant comprising a plant cell as described hereinabove. The plants may also be able to grow, or even reach maturity including for example fruit production, seed formation, seed ripening and seed setting.

Furthermore, progeny may be produced from these seeds, which progeny may be fertile. Alternatively or additionally, the transformed and regenerated plants may also produce progeny by non-sexual propagation such as cloning, grafting. The generated transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, a first generation (or T1) transformed plant may be selfed to give homozygous second generation (or T2) transformants, and the T2 plants further propagated through classical breeding techniques. The generated transformed organisms may take a variety of forms. For example, they may be chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells transformed to contain the expression cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion).

Following DNA transfer and growth of the transformed cells, putatively transformed plant cells or plants may be evaluated, for instance using Southern analysis, for the presence of the gene of interest, copy number and/or genomic organization. Alternatively or additionally, expression levels or expression patterns of the newly introduced DNA may be undertaken using northern and/or Western analysis, both techniques being well known to persons having ordinary skill in the art.

The present invention dearly extends to plants obtainable by any of the methods according to the present invention, which plants comprise any of the isolated promoters or the constructs of the present invention. The present invention clearly extends to any plant parts and propagules of such plant. The present invention extends further to encompass the progeny of a primary transformed cell, tissue, organ or whole plant that has been produced by any of the aforementioned methods, the only requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) as those produced in the parent by the methods according to the invention. The invention also extends to harvestable parts of a plant, such as but not limited to seeds, leaves, fruits, flowers, stem cultures, stem, rhizomes, roots, tubers, bulbs and cotton fibers.

The term “plant” or “plants” as used herein encompasses whole plants, ancestors and progeny of plants and plant parts, including seeds, shoots, stems, roots (including tubers), and plant cells, tissues and organs. The term “plant” therefore also encompasses suspension cultures, embryos, meristemalic regions, callus tissue, gametophytes, sporophytes, pollen, and microspores. Plants that are particularly useful in the methods of the invention include all plants which belong to the superfamily Vridiplantae, in particular monocotyledonous and dicotyledonous plants including a fodder or forage legume, ornamental plant, food crop, tree, or shrub selected from the list comprising Acacia spp., Acer spp., Actinidia spp., Aesculus spp., Agathis australis, Albizia amara, Alsophila tricolor, Andropogon spp., Arachis spp, Areca catechu, Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp., Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, Camellia sinensis, Canna indica, Capsicum spp., Cassia spp., Centroema pubescens, Chaenomeles spp., Cinnamomum cassia, Coffea arabica, Colophospermum mopane, Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., Dicksonia squarosa, Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium rectum, Echinochloa pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrina spp., Eucalyptus spp., Euclea schimperi, Eulaliav illosa, Fagopyrum spp., Feijoa sellowiana, Fragaria spp., Flemingia spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, Hedysarum spp., Hemarthia altissima, Heteropogon contortus, Hordeum vulgare, Hyparrhenia rufa, Hypericum erectum, Hypertheria dissolute, Indigo incamate, Iris spp., Leptarrhena pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonus bainesii, Lotus spp., Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago sativa, Metasequoia glyptostroboides, Musa sapientum, Nicotianum spp., Onobrychis spp., Omithopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, Photinia spp., Picea glauca, Pinus spp., Pisum sativum, Podocarpus totara, Pogonarthria fleckii, Pogonarthria squanrosa, Populus spp., Prosopis cineraria, Pseudotsuga menziesii, Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphiolepsis umbellate, Rhopalostylis sapida, Rhus natalensis, Ribes grossularia, Ribes spp., Robinia pseudoacacia, Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Sciadopitys verticillata, Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, Spinacia spp., Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagi spp, Taxodium distichum, Themeda triandra, Trifolium spp., Triticum spp., Tsuga heterophylla, Vaccinium spp., Vicia spp. Vitis viniera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mays, amaranth, artichoke, asparagus, broccoli, brussel sprout, cabbage, canola, carrot, cauliflower, celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, sugarbeet, sugar cane, sunflower, tomato, squash, and tea, trees and algae amongst others. According to a preferred feature of the present invention, the plant is a crop plant such as soybean, sunflower, canola, alfalfa, rapeseed, cotton, tomato, potato, tobacco, squash, papaya, poplar, leguminosa, fax, lupinus or sorghum. According to another preferred embodiment of the present invention the plant is a monocotyledonous plant, such as sugarcane, further preferable a cereal such as rice, maize, wheat, barley, millet, rye or oats.

The invention further provides a method for driving and/or regulating expression of a nucleic acid in a plant or plant cell, comprising:

-   -   (a) operably linking a nucleic acid to a promoter according to         the invention as described hereinabove, such as to any one of         SEQ ID NO 1 to 9 or a variant thereof, and     -   (b) introducing the resultant genetic construct into a plant or         plant cell.

Preferably the operably linked nucleic acid of (a) is heterologous to the nucleic acids according to the present invention.

Preferably the genetic construct is stably introduced into the plant or plant cell.

This method may further comprise cultivating the transformed plant or plant cell under conditions promoting growth, promoting regeneration and/or promoting maturation.

Furthermore, the expression of the operably linked nucleic acid may be driven and/or regulated in particular cells, tissues or organs of a plant. Accordingly, the invention provides a method as described above, wherein the expression is constitutive expression or tissue-specific expression. For these embodiments, reference is made to the example section where the specific expression patterns of the promoters according to the invention are described and where different types of tissue-specific expression are detailed.

The present invention further encompasses the use of an isolated promoter as defined hereinabove to drive and/or regulate expression of an operably linked nucleic acid.

The person skilled in the art will recognize that provision of sequences SEQ ID NO 1 to 9, readily makes available the tools to isolate related promoters, which may have substantial sequence identity to any one of SEQ ID NO 1 to 9. Additionally, provision of sequences SEQ ID NO 10 to 18 (CDS corresponding to the promoters of the present invention, see Table 1), readily makes available the tools to isolate related promoters, of which the related CDSs may have substantial sequence identity to any one of SEQ ID NO 10 to 18. Therefore the present invention also encompasses a method for isolating nucleic acids, capable of driving and/or regulating expression of an (heterologous) operably linked nucleic acid, comprising screening a nucleic acid sequence database to find homologues of any of the sequences represented by SEQ ID NO 1 to 9 or SEQ ID NO 10 to 18. Subsequently synthetic nucleic acid that correspond to the sequence of these homologues are used to screen a library with genomic DNA, which library is for example prepared from the organism of origin of the above mentioned homologue. The screening procedure may for example involve hybridization. Subsequently, the genomic DNA that matches the homologue, is analysed to identify the transcription initiation site and the translation initiation site of the gene corresponding to the homologue. Finally, specific primers are designed for amplification of a nucleic acid located in the region upstream (at the 5′ end) of said translation initiation site.

The present invention also extends to the identification of regulatory proteins that are involved in the regulation of the activity of the promoters according to the present invention. Such identification may be achieved using a yeast one-hybrid system. In such a yeast one-hybrid system the sequences according to any one of SEQ ID NO 1 to 9 are operably linked to the GAL transcription activator and transformed into yeast cells. These yeast cell are again transformed with a library of constructs encoding candidate regulatory factors.

The present invention will now be described with reference to the following figures in which:

FIG. 1 shows a general schematic representation of a promoter. Regulatory elements are sequences that may for example be responsible for special and/or temporal regulation of the promoter activity. The minimal promoter is the minimal sequence necessary and sufficient to drive expression. It includes a TATA box, which is necessary to correctly direct the RNA polymerase II to the transcription initiation site. The transcription initiation element (INR) includes the transcription initiation start site. The 5′ untranslated region (5′UTR) is the region that is transcribed into pre-messenger RNA and eventually in to mRNA, but is not translated into protein. The translation initiation codon is represented by the startcodon ATG.

FIG. 2 is a map of the vector p4582 useful for expression in plants of β-glucuronidase (GUS) gene under control of any one of the promoters according to the invention. Th is binary vector comprises a Gateway recombination cassette, suitable for the recombination cloning of any of the promoters of the present invention in front of the Escherichia coli β-glucuronidase (GUS) gene. This cassette contains a chloramphenicol resistance gene (CamR) and the ccdB suicide gene for counter selection of non-recombined plasmids. This GUS expression cassette further comprises the double terminator sequence T-zein and T-rbcS-deltaGA. This expression cassette is located within the left border (LB repeat, LB Ti C58) and the right border (RB repeat, RB Ti C58) of the nopaline Ti plasmid. Cloned within these borders are also selectable marker and a screenable marker genes each under control of a constitutive promoter and a terminator sequence. This vector also contains an origin of replication (pBR322) for bacterial replication and a bacterial selectable marker (Spe/SmeR) for bacterial selection.

The following figures show the results of the GUS staining of plants or plant parts transformed with the reporter vector p4582 carrying a promoter according to the present invention operably linked to the reporter gene GUS. Plants denoted “C plants” are transgenic plants grown to about 5 cm; Plants denoted “B plants” are grown to about 10 cm; and plants denoted “A plants” are grown to maturity. These A plants were used to collect different tissue samples from old leaves, young leaves and seeds.

FIG. 3 shows the expression pattern of PRO0162 (fructose bi-phopshate aldolase, SEQ ID NO 1). GUS staining is visible in all parts of the plant.

FIG. 4 shows the expression pattern of PRO0143 (Cyclin D2, SEQ ID NO 2). GUS staining is visible in hydathodes and shoot meristem.

FIG. 5 shows the expression pattern of PRO0144 (Cyclin D3, SEQ ID NO 3). GUS staining is visible in actively dividing tissues, root cylinder, hydathodes and seeds.

FIG. 6 shows the expression pattern of PRO0161 (rubisco activase, SEQ ID NO 4). GUS staining is visible in shoots, flowers and siliques. GUS staining is also weakly visible in roots and embryos.

FIG. 7 shows the expression pattern of PRO0183 (putative extensin, SEQ ID NO 5). GUS staining is visible in roots. There is also very weak GUS staining visible in flowers and siliques.

FIG. 8 shows the expression pattern of PRO0185 (12S cruciferin AtCRU3, SEQ ID NO 6). GUS staining is visible in seeds.

FIG. 9 shows the expression pattern of PRO0190 (putative protein, SEQ ID NO 7). GUS staining is visible in trichomes of young developing leaves.

FIG. 10 shows the expression pattern of PRO0193 (FAD2 desaturase, SEQ ID NO 8). GUS staining is visible in young developing tissues, including seeds.

FIG. 11 shows the expression pattern of PRO0194 (G3PDH-like, SEQ ID NO 9). GUS staining is visible in young developing tissues and seeds.

EXAMPLES

The promoters according to the present invention were isolated as DNA regions spanning about 1.2 kb of the sequence upstream of the translation initiation codon (i.e. first ATG, which codon was excluded) from various Arabidopsis genes. For determination of their nucleic acid sequence and their expression pattern, the following procedure was followed: First in silico studies on genomic Arabidopsis sequences were performed. However, procedures based on automated prediction programs to locate promoter like nucleic acid sequence are highly error prone, even for the localization the best-characterized promoter control elements such as the TATA box and the transcription initiation element (INR). Also, in silico determination of expression pattern is extremely speculative. Therefore, to obtain unambiguous data about the nucleic acid sequence and the expression pattern of the promoters, in vivo studies were performed encompassing (i) isolation of the promoter nucleic acid sequence; (ii) operably linking a reporter gene to the promoter and introducing the resulting genetic construct into a host organisms (iii) growing the transformed host cell under conditions allowing expression of the reporter gene; and (iv) determination of the reporter gene activity in the different tissues of the host organism. These methods are now described in more detail.

Example 1 Identification and Isolation of the Arabidopsis Promoters

Identification of ESTs, the Corresponding Genes and their Location in the Genome

Sequence databases, comprising Arabidopsis sequences, were searched for Arabidopsis expressed sequence tags (ESTs). Subsequently an “in silico” Northern-blot was performed to allow identification of EST families that are strongly expressed or that are specific for a particular organ. This analysis included normalization of the numbers of ESTs isolated from different plant organs. The ESTs families with an interesting distribution among source cDNA libraries were selected for further analysis and sequence homology searches. After sequence homology searches in combination with scanning scientific data, the genes that correspond to those families of ESTs were identified from sequence databases and a (putative) function and corresponding gene name was given (see Table 1). Subsequently, the corresponding promoter region was isolated by the following procedure. In a first step the TIGR Arabidopsis transcribed sequence database was searched to find a tentative contig corresponding to an EST family (http://www.tiqr.org/, The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, Md. 20850). Sequence homology was found using standard computer programs, such as Blast N using standard parameters (typically G Cost to open a gap=5, E Cost to extend a gap=2, q Penalty for a mismatch in the blast portion of run=−3, r Reward for a match in the blast portion of run=1, e Expectation value=10.0, W Word size=11, v Number of one-line descriptions=100, b Number of alignments to show=100, Matrix=BLOSUM62). The TIGR database, provides Tentative Contigs (TC) which are sequence predictions based on contig building from all known EST, from all known cDNA and from reconstructed mRNA. The TCs used for identification of the promoters of the present invention are represented by Table 1. In a second step these TCs were used to locate the corresponding gene on a genomic sequence, which gene comprises the coding region as well as the promoter region. Generally, these genomic sequences were SAC cones, which are represented herein by their Genbank accession number (see Table 1). From these BAC clones the sequence identity of the promoter region could be determined. TABLE 1 list of Arabidopsis promoters of the present invention. The promoter sequences are represented herein by their SEQ ID NO and promoter number (PRO). The coding sequences (CDS) naturally driven by a promoter of the present invention are represented by their name, by SEQ ID NO and by Tentative contig (TC) accession number of the TIGR database. The Genomic sequences (BAC clones or genes) comprising a promoter region of the present invention are represented by their Genbank accession number. Prom Prom CDS BAC clone SEQ ID NO number CDS name SEQ ID NO CDS TC (*or gene) 1 PRO0162 fructose bi-phopshate aldolase 10 TC149436 AL132969 2 PRO0143 Cyclin D2 11 TC156804 AC006592 3 PRO0144 Cyclin D3 12 TC163888 AL021961 4 PRO0161 rubisco activase 13 TC160636 + TC160635 M86720* 5 PRO0183 putative extensin 14 TC160873 + TC160875 AC004450 6 PRO0185 12S cruciferin AtCRU3 15 TC160506 AL021749 7 PRO0190 putative protein 16 TC149672 AL162651 8 PRO0193 FAD2 desaturase 17 TC160765 AP002063 9 PRO0194 G3PDH-like 18 TC160629 AC027134 Identification and Isolation of the Promoter Regions of Arabidopsis Genes

Starting from the sequence information of the genes and their location in the Arabidopsis genome, the promoter regions of these genes were isolated as the DNA region spanning about 1.2 kb upstream of the translation initiation codon (i.e. first ATG), which codon was excluded. When an intervening sequence such a s an intron, was present in the 5′ untranslated region of the gene, the isolated DNA region was taken as the region spanning about 1.2 kb plus the length of that intervening sequence. The promoter regions were isolated from genomic DNA of Arabidopsis thaliana via PCR using specific primers. These specific primers comprise AttB recombination sites, suitable for recombination cloning of the isolated promoter region. These specific primers are herein represented as SEQ ID NO 19 to 36 and are listed in Table 2. Conditions for PCR were as follows: 1 cycle of 2 min at 94° C., 35 cycles of 1 min at 94° C., 1 min at 58° C. and 2 min at 68° C., and 1 cycle of 5 min at 68° C. The length of the expected PCR fragment, which is the length of the promoter of the present invention, is also indicated in Table 2. The corresponding PCR fragment was purified from the PCR reaction mix via gel electrophoresis and subsequent purification using Zymoclean Gel DNA Recovery lit (Zymo Research, Orange, Calif.). TABLE 2 Overview of the primers used to isolate the Arabidopsis promoters of the present invention and the length of the Arabidopsis promoter regions. Promoter Promoter Prom Primer forward Primer Primer reverse Primer SEQ ID NO number length SEQ ID NO forward SEQ ID NO reverse 1 PRO0162 1223 19 prm3275 28 prm3276 2 PRO0143 1197 20 prm2943 29 prm2944 3 PRO0144 1219 21 prm2945 30 prm2946 4 PRO0161 1134 22 prm3273 31 prm3274 5 PRO0183 1239 23 prm4475 32 prm4476 6 PRO0185 1067 24 prm4479 33 prm4480 7 PRO0190 1178 25 prm4489 34 prm4490 8 PRO0193 2510 26 prm4498 35 prm4499 9 PRO0194 1050 27 prm4500 36 prm4501

Example 2 Cloning of Promoter-GUS Reporter Vectors for Plant Transformation

The purified PCR fragments of Example 1, corresponding to the promoter regions of the present invention, were cloned into the pDONR201 entry plasmid of the Gateway™ system (Life Technologies) using the “BP recombination reaction”. The identity and base pair composition of the cloned insert was confirmed by sequencing and additionally, the resulting plasmid was tested via restriction digests.

In order to clone each of the promoters of the present invention in front of a reporter gene, each entry done as mentioned above was subsequently used in an “LR recombination reaction” (Gateway™) with the destination vector p4582. This destination vector was designed to operably link each promoter of the present invention to the Escherichia coli beta-glucuronidase (GUS) gene via the substitution of the Gateway recombination cassette in front of the GUS gene. Furthermore this destination vector is suitable for transformation of plants and comprises within the T-DNA left and right borders the resulting promoter-GUS cassette and selectable marker and screenable marker cassettes (see FIG. 2). The resulting reporter vectors, comprising a promoter of the present invention operably linked to GUS, are subsequently transformed into Agrobacterium strain LBA4044 and subsequently into Arabidopsis plants using standard transformation techniques.

Example 3 Transformation of Arabidopsis thaliana with Promoter GUS Reporter Vectors

Sowing and Growing of the Parental Plants

For the parental Arabidopsis plants, approximately 12 mg of wild-type Arabidopsis thaliana (ecotype Columbia) seeds were suspended in 27.5 ml of 0.2% agar solution. The seeds were incubated for 2 to 3 days at a temperature of 4° C. and then sown. The plants were germinated under the following standard conditions: 22° C. during the day, 18° C. at night, 65-70% relative humidity, 12 hours of photoperiod, sub-irrigation with water for 15 min every 2 or 3 days. The seedlings that developed were then transplanted to pots with a diameter of 5.5 cm, containing a mixture of sand and peat in a ratio of 1 to 3. The plants were then further grown under the same standard conditions as mentioned above.

Agrobacterium Growth Conditions and Preparation

Agrobacterium strain C58C1 RIF containing helper plasmid pMP90 and a reporter vector with a promoter of the present invention, was inoculated in a 50 ml plastic tube containing 1 ml LB (Luna Broth) without antibiotic. The culture was shaken for 8-9 h at 28° C. Subsequently, 10 ml of LB without antibiotic was added to the plastic tube and shaken overnight at 28° C. after which the OD at 600 nm was measured. At an optical density of approximately 2.0, 40 ml of 10% sucrose and 0.05% Silwet L-77 (a chemical mixture of polyalkyleneoxide modified heptamethyltjisiloxane (84%) and allyloxypolyethyleneglycol methyl ether (16%), OSI Specialties Inc) was added to the culture.

Flower Dip

When each parental flowering plant had one inflorescence of 7-10 cm in height, the inflorescences were inverted into the Agrobacterium culture and agitated gently for 2-3 seconds. 2 inflorescences per transformation were used. Following this, the plants were returned to the normal growing conditions as described above.

Seed Collection

5 weeks after the flowers were dipped in the Agrobacterium culture, watering of the plants was stopped. The plants were incubated at 25° C. with a photoperiod of 20 hours. One week later, the seeds were harvested and placed in a seed drier for one week. The seeds were then cleaned and collected in 15 ml plastic tubes and stored at 4° C. until further processing.

Example 4 Evaluation of the First Generation of Transgenic Arabidopsis Plants Comprising the Promoter-GUS Reporter Construct

Growth and Harvest of Transgenic Plants or Plant Parts at Various Stages (C Plants, B Plants and A Plants)

For each transgenic line, seeds were incubated for 2 to 3 days at a temperature of 4° C. and then sown. The plants were grown under the following standard conditions: 22° C. during the day, 18° C. at night, 65-70% relative humidity, 12 hours of photoperiod, sub-irrigation with water for 15 min every 2 or 3 days. At the 2-leaves stage, about 10 plants were sacrificed and stained for GUS expression. These plants are named herein “C plants” for the purpose of evaluation. At the 10-leaves rosette stage, about 10 plants were sacrificed and stained for GUS expression. These plants are named herein “B plants” for the purpose of evaluation. The remaining plants were cultivated until seeds setting. These plants are named herein “A plants” for the purpose of evaluation. At that “A plant” stage, leaves, stems, siliques and seeds were sampled and stained for GUS expression. A plants were thus allowed to set seed, and the remaining seeds were used for confirmation of the expression pattern in plants of the second generation.

GUS Staining

The sacrificed plants or plant parts were covered with 90% ice-cold acetone and incubated for 30 min at 4° C. After 3 washes of 5 min with Tris buffer [15.76 g Trizma HCl (Sigma T3253)+2,922 g NaCl in 1 l bidi, adjusted to pH 7.0 with NaOH], the material was covered by a Tris/ferricyanate/X-Gluc solution [9.8 ml Tris buffer+0.2 ml ferricyanate stock (0.33 g Potassium ferricyanate (Sigma P3667) in 10 ml Tris buffer)+0.2 ml X-Gluc stock (26.1 mg X-Gluc (Europa Bioproducts ML 113A) in 500 μl DMSO)]. Vacuum infiltration was applied for 15 to 30 minutes. The plants or plant parts were incubated for up to 16 hours at 37° C. until development of blue colour was visible. The samples were washed 3 times for 5 minutes with Tris buffer. Chlorophyll was extracted in ethanol series of 50%, 70% and 90% (each for 30 minutes).

Expression Patterns of the Promoters of the Present Invention

The expression pattems of the Arabidopsis promoters of the present invention are summarized in Table 3. TABLE 3 expression patterns of the Arabidopsis promoters of the present invention PRO SEQ ID Promoter Expression pattern - NO number Promoter name specificity 1 PRO0162 fructose bi- constitutive phopshate aldolase 2 PRO0143 Cyclin D2 hydathode/meristem 3 PRO0144 Cyclin D3 meristem/dividing tissue/seed 4 PRO0161 rubisco activase shoot 5 PRO0183 putative extensin root 6 PRO0185 12S cruciferin seed AtCRU3 7 PRO0190 putative protein trichomes 8 PRO0193 FAD2 desaturase developing tissue preferred 9 PRO0194 G3PDH-like developing tissue preferred

The following paragraphs describe the observed expression patterns of the promoters of the present invention in more detail. The observations are based on the visual inspection of the GUS stained tissues as described above. It is to be understood that for some promoters expression may be weak and that expression in certain tissues may only be visible with very sensitive detection methods.

PRO0162, SEQ ID NO 1, Fructose bi-phopshate Aldolase

1 reporter construct (AT1221) was investigated and 8 C, 6 B and 8 A plants were analysed. C plants showed strong expression in the central cylinder of roots (100%). in root tip (75%). in meristem, in hypocotyl (100%) and in all young leaf tissues (100%). C plants showed also weak expression in old hydathodes of leaves (75%) and in cotyledons petioles (37%). B plants showed strong expression in roots (100%), in apical meristem, in hypocotyts (100%), in young leaves (100%). B plants showed also weak expression in old leaves (50%). A plants showed, strong expression in flowers (100%) but weak expression in petals (71%), strong expression in silique suture (86%) and strong expression in seeds. Therefore, promoter PRO162 is suitable for expression in all parts of the plant. This promoter is suitable as constitutive promoter and is especially active in young tissues.

PRO0143, SEQ ID NO 2, Cyclin D2

1 construct (AT1218), which is a reporter vector as described i n Example 2 comprising PRO0143 was investigated. 8 C plants, 6 B plants and 7 A plants were analysed. C plants showed strong expression in apical meristem (100%) and in hydathodes of young and old leaves (100%). B plants showed weak expression in hypocoty is (60%), strong expression in apical meristem (100%) and strong expression in hydathodes of young leaves (80%). A plants showed strong expression in stamen (86%). Therefore, promoter PRO0143 is suitable for expression especially in hydathodes and meristem, such as shoot meristem.

PRO0144,SEQ ID NO 3, Cyclin D3

1 construct (AT1220) was investigated and 7 C, 7 B and 8 A plants were analysed. C plants showed strong expression in the central cylinder of roots (86%). strong expression in meristem (86%), strong expression in all young leaves tissues (86%), strong expression in hydathodes of old leaves (86%), but no expression in cotyledons. B plants showed strong expression in the central cylinder of roots (100%) except root tip, strong expression in apical meristem, strong expression in hypocotyls (100%), strong expression in young leaves except trychomes (100%) and expression in hydathodes of old leaves (71%). A plants showed strong expression in pedicel, pistil (100%) and in cotyledons of embryos, but no expression in leaves or siliques, Therefore, promoter PRO0144 is suitable for expression in actively dividing tissues, such as meristems, root cylinder, hydathodes and seeds.

PRO0161, SEQ ID NO 4, Rubisco Activase

1 construct (AT1222) was investigated and 8 C, 5 B and 6 A plants were analysed. C plants showed strong expression in shoots (75-100%) and no expression in roots. B plants showed strong expression in shoot (100%) and weak expression in roots (40-60%). A plants showed strong expression in flower (100%), expression in siliques and leaves (33% 66%) and weak expression in embryos. It was conduded that PRO0161 is highly active in shoot with some leakiness in roots.

PRO0183, SEQ ID NO 5, Putative Extensin

1 construct (AT1316) was investigated and 7 C, 6 B and 7 A plants were analysed. C plants showed expression in roots, apical meristem and weak expression in hydatodes. B plants showed expression in the central cylinder of roots, in hypocotyls and apical meristem, but no expression in leaves. A plants showed expression in stamen and siliques, but no expression in leaves and no expression in seeds. Therefore, promoter PRO0183 is suitable as a root-referred promoter, with some weak expression in flowers and siliques.

PRO0185, SEQ ID NO 6, 12S Cruciferin A tCRU3

1 construct (AT1321) was investigated and 8 C, 7 B and 7 A plants were analysed. C plants showed no expression. B plants showed no expression in leaves and some leaky expression in roots. A plants showed no expression in shoots or leaves, but there was strong expression in seeds. Therefore, promoter PRO0185 is suitable as a seed-specific promoter.

PRO0190, SEQ ID NO 7, Putative Protein

1 construct (AT1391) was investigated and 8 C, B and 2 A plants were analysed. Expression was observed only in trichomes of C plants. No expression was observed in B plants or A plants. Therefore, promoter PRO0190 is suitable as a promoter specific for trichomes of young developing leaves.

PRO0193, SEQ ID NO 8, FAD2 Desaturase

1 construct (AT1317) was investigated and 8 C, 6 B and 8 A plants were analysed. C plants showed some expression in the central cylinder of roots (25-62%), but not in lateral roots, some expression in apical meristem (62%) and some expression in young leaves (50%). B plants showed strong expressi on everywhere (66-100%) except in trychomes and in old leaves, where the expression was rather low. A plants showed strong expression in seeds and some expression in pedicels, stamen (57-71%) and siliques (37-25%). A plants showed no expression in leaves. Therefore, promoter PRO0193 is suitable for expression in young developing tissues, including seeds. Promoter PRO0193 is considered to be constitutive in young active tissues.

PRO0194, SEQ ID NO 9, G3PDH-like

1 construct (AT1421) was investigated and 8 C, 7 B and A plants were analysed. C plants showed strong expression in the central cylinder of roots (71%) and in young leaves (100%), but no expression in cotyledons. B plants showed also strong expression in the central cylinder of roots (71%) and in young leaves (100%), but no expression in lateral roots or old leaves. B plants showed also expression in hydathodes of old leaves. A plants showed expression in flowers and siliques. Therefore, promoter PRO0194 is suitable for expression in young developing tissues, but not in mature tissues and seed.

Example 5 Stability of the Expression Patterns of the Promoters of the Present Invention in Further Generations

The above-mentioned analyses were performed on plants originating from the seeds of transformed flowers. The stability of promoter activity in the next generations or progeny plants of the original T0 plant, the so-called T1 and T2 plants, was evaluated as follows. The T0 plant transformed with the reporter constructs as mentioned in Example 2, were grown until maturity (A plants), of which the seeds (T1 seeds) were harvested and sown to generate progeny T1 plants. These plants were analysed as described above in Example 3 and the A T1 plants were allowed to reach maturity and to set T2 seeds.

The expression pattern of the promoters of the present invention was studied in T0 plants, T1 seeds, T1 plants and T2 seeds and in all the tissues as described in Example 3. The specific expression patterns as reported from the T0 and T1 seeds and described in Example 4 were confirmed in the following T1 generation and T2 seeds. It is concluded that the expression pattern of the promoters of the present invention are stably inherited in plants of subsequent generations.

Example 6 Stability of Expression Patterns of the Promoters of the Present Invention in Other Plants

The above-mentioned plant analyses were performed on Arabidopsis thaliana plants. This choice was based on the practical consideration that Arabidopsis thaliana plants are good model plants for many dicots and monocols. The reporter constructs comprising the promoters according to the present invention are also transformed into other plants, such as rice or corn, and these transformed plants are evaluated as described hereinabove. The expression patterns of the promoters according to the present invention are conserved among plants. Therefore, the promoters according to the present invention are also suitable for driving and/or regulating expression of an operably linked nucleic acid in monocots, such as rice or corn. 

1-18. (canceled)
 19. An isolated promoter capable of driving and/or regulating expression, comprising: (a) an isolated nucleic acid as represented by any one of SEQ ID NO 1 to 9 or the complement of any one of SEQ ID NO 1 to 9; or (b) an isolated nucleic acid having at least 90% sequence identity with any one of the DNA sequences as represented by any one of SEQ ID NO 1 to 9; or (c) an isolated nucleic acid specifically hybridizing under stringent conditions with any one of the DNA sequences as represented by any one of SEQ ID NO 1 to 9; or (d) an isolated nucleic acid as defined in any one of (a) to (c), which is interrupted by an intervening sequence; or (e) a fragment of any one of the nucleic acids as defined in (a) to (d), which fragment is capable of driving and/or regulating expression.
 20. A promoter according to claim 19, which is a hybrid promoter comprising at least one part of a promoter as defined in claim 19 and further comprising a part of another promoter.
 21. A genetic construct comprising: (a) an isolated promoter as defined in claim 19; and (b) a heterologous nucleic acid sequence operably linked to said promoter of (a); and optionally (c) a 3′ transcription terminator.
 22. An expression cassette comprising a genetic construct as defined in claim
 21. 23. A host cell comprising an isolated promoter as defined in claim 19, or genetic construct as defined in claim 21, or an expression cassette as defined in claim
 22. 24. Host cell according to claim 23, selected from a bacteria, algae, fungi, yeast, plant, insect and animal host cell.
 25. A transgenic plant cell comprising an isolated promoter as defined in claim 19, or a genetic construct as defined in claim 21, or an expression cassette as defined in claim
 22. 26. Transgenic plant cell according to claim 25, which is a monocot plant cell or a dicot plant cell.
 27. A transgenic plant comprising a transgenic plant cell as defined in claim
 26. 28. A transgenic plant according to claim 27, wherein said plant is selected from rice, maize, wheat, barley, millet, oats, rye, sorghum, soybean, sunflower, canola, sugarcane, alfalfa, bean, pea, flax, lupinus, rapeseed, tobacco, tomato, potato, squash, papaya, poplar and cotton.
 29. Plant part, preferably a harvestable part, a propagule or progeny of a plant as defined in claim
 26. 30. Method for driving and/or regulating expression of a nucleic acid in a plant or plant cell, comprising: (a) operably linking said nucleic acid to a promoter as defined in claim 19, and (b) introducing the resultant genetic construct into a plant or plant cell.
 31. Method according to claim 30, wherein said expression is constitutive or tissue-specific.
 32. Method for the production of a transgenic plant, comprising: (a) introducing into a plant cell an isolated promoter as defined in claim 19, or a genetic construct as defined in claim 21, or an expression cassette as defined in claim 22, and (b) cultivating said plant cell under conditions promoting plant growth. 